Avg. Model Size vs. Changes No. Model Size Error vs. Changes No
نویسنده
چکیده
model of its opponent's strategy based on its past behavior, and uses the model to predict its future behavior. We represent an interaction between agents by a repeated-game and restrict our attention to opponent strategies that can be represented by DFA. Learning a minimal DFA without a teacher was proved to be hard. We presented an unsu-pervised algorithm, US-L , based on Angluin's L algorithm. The algorithm maintains a model consistent with its past examples. When a new counterexample arrives it tries to extend the model in a minimal fashion. We presented a method for constructing an optimal strategy against the learned automaton. We conducted a set of experiments where random automata that represent diierent strategies were generated, and the algorithm tried to learn them based on a random set of game-histories. The algorithm managed to learn very compact models with high accuracy. The experimental results suggest that for random preex-closed samples the algorithm behaves well. However, following Angluin's result on the diiculty of learning almost uniform complete samples Ang78], it is obvious that our algorithm does not solve the complexity issue of inferring a DFA from a general preex-closed sample. We are currently looking for classes of preex-closed samples where US-L behaves well. The work presented here is only a rst step in the area of opponent modeling. The US-L algorithm enables an adaptive player to model an other agent's strategy in order to nd a proper response. The tasks of modeling adaptive players, modeling players that hide their interactive strategies, or avoiding other agent's attempts to model your strategy, are extremely diicult and deserve further research. The complexity of computing a best response automaton in repeated games with mixed strategies. Figure 7: The average model size learned by US-L , and the average error of the learned models, as a function of the changes number. In the third experiment we tested US-L with non-random automata. We repeated the famous tournament managed by Axelrod Axe84] for the Iterated Prisoner's Dilemma game (IPD) and allowed our adaptive player to observe the tournament and build models for all the attendees. In the original tournament, fteen attendees (strategies) competed in a round robin tournament, where any interaction was based on 200 repetitions of the PD game. In our tournament we allowed only deterministic players to participate (10 players). After building the models, the adaptive player joined the tournament and played against the …
منابع مشابه
Distal coronary embolization following acute myocardial infarction increases early infarct size and late left ventricular wall thinning in a porcine model
BACKGROUND Distal coronary embolization (DCE) of thrombotic material occurs frequently during percutaneous interventions for acute myocardial infarction and can alter coronary flow grades. The significance of DCE on infarct size and myocardial function remains unsettled. The aims of this study were to evaluate the effects of DCE sufficient to cause no-reflow on infarct size, cardiac function an...
متن کاملStatins but not fibrates improve the atherogenic to anti-atherogenic lipoprotein particle ratio: a randomized crossover study
BACKGROUND Prior studies suggested low density lipoprotein particle (LDLP) size is a predictor of atherosclerosis. Knowledge of effects of lipid lowering drugs on lipoprotein subclasses is useful. We treated subjects with hyperlipidemia sequentially with statins and fibrates, the 2 main classes of lipid lowering therapy and studied changes in NMR lipoprotein subclasses. METHODS 35 subjects (2...
متن کاملReduced glomerular size selectivity in late streptozotocin-induced diabetes in rats: application of a distributed two-pore model
Microalbuminuria is an early manifestation of diabetic nephropathy. Potential contributors to this condition are reduced glomerular filtration barrier (GFB) size- and charge selectivity, and impaired tubular reabsorption of filtered proteins. However, it was recently reported that no significant alterations in charge selectivity of the GFB occur in early experimental diabetic nephropathy. We he...
متن کاملRelationships between exercise-induced reductions in thigh intermuscular adipose tissue, changes in lipoprotein particle size, and visceral adiposity.
Small LDL and HDL particle size are characteristic of a proatherogenic lipoprotein profile. Aerobic exercise increases these particle sizes. Although visceral adipose tissue (VAT) has been strongly linked with dyslipidemia, the importance of intermuscular adipose tissue (IMAT) to dyslipidemia and exercise responses is less well understood. We measured exercise-associated changes in thigh IMAT a...
متن کاملEffect of lifestyle education based on Pender model on frailty outcomes in community-dwelling older adults
Background: Frailty is a complex syndrome in which the reduction of physiological reserves in various organs increases vulnerability to stressors and negative health outcomes in the elderly. Considering that no specific intervention based on Pender lifestyle education has been performed to reduce the outcomes of this syndrome, the present study aims to determine the Effect of lifestyle educatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011